Decision Tree Algorithm Considering Distances Between Classes
نویسندگان
چکیده
Decision tree algorithm (DT) is a commonly used data mining method for classification and regression. DT repeatedly divides dataset into pure subsets based on impurity measurements such as entropy Gini. Then relatively “pure” partitions consisting of observations with the (almost) same class are obtained. Gini index one representative indices measuring data. However, does not take account distances between classes. If classes considered when impurity, decision can distinguish clearly different To end, new Rao-Stirling proposed considering considers in way that weights more to pairs references distant impurity. Experimental results indicate superior terms accuracy, implying help improve accuracy DT.
منابع مشابه
On Distances between Isomorphism Classes of Graphs
Various measures of distance between (the isomorphism classes of) two graphs have been proposed. These include measures proposed by Zelinka [6],[7], Baláž, Koča, Kvasńıčka & Sekanina [1] and Johnson [4]. Further, there is the edge rotation distance of Chartrand, Saba & Zou [3] and the edge shift distance of Johnson [5]. It is on the latter two that we build. For terminology and notation not exp...
متن کاملA hybrid genetic algorithm / decision tree approach for coping with unbalanced classes
This paper proposes a new approach for coping with the problem of unbalanced classes, where some class(es) is(are) much less frequent than the other(s). The proposed approach is a hybrid genetic algorithm / decision tree system. The genetic algorithm acts as a wrapper, using the output of a decision tree algorithm (the state-of-the-art C5.0) to compute the fitness of population individuals (can...
متن کاملStructure and Majority Classes in Decision Tree Learning
To provide good classification accuracy on unseen examples, a decision tree, learned by an algorithm such as ID3, must have sufficient structure and also identify the correct majority class in each of its leaves. If there are inadequacies in respect of either of these, the tree will have a percentage classification rate below that of the maximum possible for the domain, namely (100 Bayes error ...
متن کاملGlobal Tree Optimization: A Non-greedy Decision Tree Algorithm
A non-greedy approach for constructing globally optimal multivariate decision trees with xed structure is proposed. Previous greedy tree construction algorithms are locally optimal in that they optimize some splitting criterion at each decision node, typically one node at a time. In contrast, global tree optimization explicitly considers all decisions in the tree concurrently. An iterative line...
متن کاملCUDT: A CUDA Based Decision Tree Algorithm
Decision tree is one of the famous classification methods in data mining. Many researches have been proposed, which were focusing on improving the performance of decision tree. However, those algorithms are developed and run on traditional distributed systems. Obviously the latency could not be improved while processing huge data generated by ubiquitous sensing node in the era without new techn...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2022
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2022.3187172